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Mutants of Green Fluorescent Protein 

BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention is in the fields of molecular and cellular biology. More 
particularly, the invention is directed to mutants of the genes encoding Green 
Fluorescent Protein (GFP) and the proteins encoded by these mutants. The 
mutant GFPs are used to allow detection of eukaryotic and prokaryotic cells 
transfected or transformed with extrinsic genes, and to label proteins of interest 
to facilitate their localization within viable cells. 

Related Art 

Transfection of Foreign Genes 

To study the function of a gene, a technique that is commonly employed 
is the transfer of the gene into a new cellular environinent. This process, called 
"transfection," provides several advantages to the genetic scientist. For example, 
the cellular protein encoded by the gene can often be more easily studied by 
transferring the gene into a cell or organism that normally does not produce the 
protein, and then examining the effect of this protein on the host cell. The 
existence and function of regulatory genetic sequences (e.g., promoters, inhibitors 
and enhancers) may be elucidated by transfection of foreign genes into cells 
containing the regulatory sequences. The transfer of non-native or altered genes 
into a host cell also allows for large-scale production of the proteins encoded by 
the genes, a process upon which much of the current biotechnology industry is 
based. Transfection of plant embryos with foreign genes has provided genetically 
engineered plants that are more resistant to adverse environmental conditions or 
that are more nutritionally rich. Finally, gene transfer methods allow the 
introduction of new or mutated genes into whole organisms. This latter capability 
provides the opportunity for the construction of stable models of mammalian 



diseases, for large-scale production of proteins in the milk of transgenic lactating 
animals, and for the possibility of genetic therapy for certain diseases. 

A variety of techniques has been used to transfect non-native genes into 
cells (reviewed in Sambrook, J., et al.. Molecular Cloning, a Laboratory Manual, 

2nd Ed., Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press, pp. 

i 

16.30-16.55 (1989); Watson, J.D., etai. Recombinant DMA, 2nd Ed,, New York: 
W.H. Freeman and Co., pp. 213-234 (1992)), These techniques include biological 
methods such as the use of viruses (e,g,, adenovirus or certain retroviruses for 
mammalian cells, baculovirus for insect cells and bacteriophages for bacterial cells) 
or bacteria (e.g., Agrobacterium for plant cells), chemical methods such as 
calcium phosphate precipitation, DEAE-dextran-mediated endocytosis or 
liposome-mediated transfection, and physical methods such as electroporation or 
direct microinjection. For transfection of mammalian cells, the techniques most 
commonly employed currently are virus-mediated transfection, lipofection and 
electroporation. 

Detection of Gene Transfer 

Regardless of the method used, however, simply attempting to transfect 
a cell does not guarantee that a majority (or even any) of the target cells will take 
up and/or express the exogenous DN A. Indeed, it has been suggested that the 
success rate of even the most optimal techniques used for transfection results in 
stable transfer of exogenous DNA is far less than 1% (Watson, J.D., et al. 
Recombinant DNA. 2nd Ed, New York: W.H. Freeman and Co., pp. 216, 218 
(1992)). Thus, it is usually critical to determine which target cells have received 
and/or incorporated the gene(s) being transfected, for which a number of 
methodologies have been used. 



Expression 

The most obvious of these methods is to simply examine the target cells 
for expression of the exogenous gene. In this method, the transfected cells are 
grown in vitro and assayed for the presence of the protein encoded by the 
transferred gene. These assays are usually accomplished using immunological 
techniques such as Western blotting, ELISA or RIA. This type of technique is 
only useful, however, if the protein is produced in relatively high amounts 
(generally at the microgram level or above) and if suitable antibodies are available, 
neither of which is the case for some transfected genes. 

In those cases where protein expression cannot be examined, incorporation 
of exogenous genes can be determined by assaying the target cells for production 
of the mRNAs corresponding to the transferred genes. One very common 
technique for this determination is Northern blotting (Alwine, J.C., ei al^ Proc. 
Natl Acad Sci, USA 7^:5350-5354, 1977), in which RNA molecules are isolated 
from cells, separated by gel electrophoresis and electroblotted onto a solid support 
(e.g., nitrocellulose or nylon). The solid support is then overlaid with 
radiolabelled cDNAs corresponding to the transfected gene, which hybridize on 
the solid support to their complementary mRNAs. After exposing the blot to 
photographic film, the samples containing the expressed transgene are easily 
determined. While this method is more sensitive than those directly measuring 
protein expression. Northern blotting still relies on actual expression of the gene 
by the target cells, which is not always the case. 

Selection 

Another method for determining gene transfer, alternative to directly 
measuring gene expression, is to examine the effect of the gene on the transfected 
cells. For example, some transfected genes will confer upon their host cells the 
ability to grow in selective culture media or under some other environmental stress 
which non-transfected cells cannot tolerate. Genes of interest are often engineered 



into sequences conferring, for example, antibiotic resistance upon the recipient 
cells. Transfectants with these constructs will thus carry not only the gene of 
interest but also the antibiotic resistance gene which allows them to grow in 
antibiotic-containing media. Since non-transfected cells will not possess this 
resistance, any cell able to grow in media containing antibiotic will contain the 
resistance marker (the so-called "selectable marker") and the transgene that is 
linked to it. Selectable markers commonly used in such an approach are the 
neomycin {neo\ ampicillin {amp) and hygromycin {hyg) resistance genes. 

In the same way, selectable markers conferring on the transfected cells a 
metabolic advantage (e.g., ability to grow in nutrient-deficient media) have been 
used successfully. Examples of these types of selectable markers include 
thymidine kinase (Bacchetti, S., and Graham, F.L., Proa Natl Acad. ScL USA 
7^:1590-1594 (1977); Wigler, M., et aL, Cell 77:223-232 (1977)) and xanthine- 
guanine phosphoribosyltransferase (Mulligan, R.C., and Berg, P., Proc. Nail 
Acad Set, USA 75:2072-2076 (1981)), which impart to their recipients the ability 
to grow, using metabolic rescue pathways encoded by the marker genes, in media 
that inhibit vital metabolic pathways in non-transfected cells. Again, any cells able 
to grow in such media will contain the transgene linked to the marker gene. 

Selection methods such as these often require weeks of culturing of the 
cells, continuously und^ selective pressure, to provide a relatively pure population 
of stable transfectants. Many uses of transfected cells, however, are conducted 
within hours of transfection, far too soon to determine transfection success using 
either the expression or selection methods described above. These types of 
applications are facilitated by a third approach - the use of "reporter genes". 

Reporter Genes 

Reporter genes are analogous to selectable markers in that they are co- 
transfected into recipient cells with the gene of interest, and provide a means by 
which transfection success may be determined. Unlike selectable markers, 
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however, reporter genes typically do not confer any particular advantage to the 
recipient ceU. Instead reporter genes, as their name implies, indicate to the 
observer (via some phenotypic acti\dty) which cells have incorporated the reporter 
gene and thus the gene of interest to which it is linked. A number of reporter 
genes have been used, including those operating by biochemical or fluorescent 
mechanisms, each with its own advantages and limitations. 

Biochemical Reporter Genes 

Some commonly used reporter genes encode enzymes or other 
biochemical markers which, when active in the transfected cells, cause some 
visible change in the cells or their environment upon addition of the appropriate 
substrate. Two examples of this type of reporter sequence are the E, coli genes 
lacZ (encoding P-galactosidase or "P-gar') and gusA or iudA (encoding P- 
glucuronidase or "p-glu"); the former is often used as a reporter gene in animal 
cells (Hall, C.V., et al, 1 Mol Appl Genet 2:10M09 (1983); Cui, C, et aL, 
Trangenic Res. 5:182-194 (1994)), the latter in plant cells (Jefferson, R.A., Nature 
5^2:837-838 (1989);Watson, J,D., etai. Recombinant DNA, 2nd Ed, New York: 
W.H. Freeman and Co., pp. 281-282 (1992); Hull, G.A, and Devic, M.,Meth, 
MoL BioL ^9:125-141 (1995)). These bacterial sequences are useful as reporter 
genes because the recipient cells, prior to transfection, express extremely low 
levels (if any) of the enzyme encoded by the reporter gene. When transfected cells 
expressing the reporter gene are incubated with an appropriate substrate (e.g., X- 
gal for P-gal or X-gluc for P-glu), a colored or fluorescent product is formed 
which can be detected and quantitated histochemically or fluorimetrically. 

Another often-used reporter gene is the bacterial gene encoding 
chloramphenicol acetyltransferase (CAT), which catalyzes the addition of acetyl 
groups to the antibiotic chloramphenicol (Gorman, C M., et al, Moi Cell. 
Biol. 2:1044-1051 (1982); Neumann, J.R., et al, BioTechniques 5:444-446 
(1987); Eastman, A., BioTechniques 5 .120-132 (1987); Feigner, P.L,. et al, Ann. 



N.Y. Acad ScL 772:126-139 (1995)). After transfection, recipient cells are lysed 
and the lysates are incubated with radioiabelled chloramphenicol and an acetyl 
donor such as acetyl-CoA, or with unlabeled chloramphenicol and radiolabeled 
acetyl-CoA (Sleigh, M.J., Anal, Biochem, 756:251-256 (1986)). If expressed in 
the cells, CAT transfers acetyl groups to chloramphenicol, which is then easily 
assayed by chromatographic techniques, thereby giving an indication of the 
incorporation of the co-transfected gene of interest by the recipient cells. 

Using reporter genes in this way, populations of cells, or even single cells, 
can be rapidly assayed for their incorporation of the exogenous gene linked to the 
reporter gene. Since they do not rely directly on the expression of the gene of 
interest, assays of transfection success using reporter genes are usually simpler and 
more sensitive than those measuring mRNA or protein production from the 
transgene (Watson, J.D., et al. Recombinant DNA, 2nd Ed, New York: W.H. 
Freeman and Co., p. 155(1992)). However, the use of reporter genes is severely 
limited in that it usually requires sacrifice (fixation) of the cells prior to assay, and 
therefore cannot be used for assaying living cells or cultures. Thus, alternative 
means for deternfiining the incorporation of the transgene in viable cells have been 
developed. 

Fluorescent Reporter Genes 

An example of viable reporter genes that are rapidly gaining widespread 
use are those that are fluorescence-based. These genes encode proteins which are 
either naturally fluorescent or which convert a substrate from nonfluorescent to 
fluorescent. Assays using this type of reporter gene are non-destructive and, 
owing to the availability of sopWsticated fluorescence detection systems, are often 
more sensitive than biochemical reporter gene assays. 

One example of a fluorescence reporter gene is the luciferin-luciferase 
system (Bronstein, L, etal.Anal Biochem. 279:169-181 (1994)). This system 
utilizes the gene for luciferase, an ATPase enzyme isolated fi-om fireflies (Gould, 
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S.J,, and Subramani, S., Anal Biochem. J75:5-]3 (1988)) and other beetles 
(Wood, K.V., etaL,J, Biolumin. Chemilumin. ^:289-301 (1989)), or from cert^n 
bioluminescent bacteria (Stewart, G.S., and Williams, P., J. Gen, Microbiol 
73«: 1289-1300 (1992); Langridge, W., et al, J. Biolumin. Chemilumin. 9:185- 
200 (1994)). For use as a reporter gene, the luciferase gene is placed into a vector 
also containing the gene of interest, or separate vectors containing the luciferase 
gene and the gene of interest are mixed together. CeDs are then transfected with 
the vector(s) and treated with the luciferase substrate luciferin which is rendered 
luminescent (and impemieant) intracellularly by the action of the luciferase. Cells 
containing the luciferase gene, and thus the gene of interest linked to it, can then 
be rapidly and sensitively observed using luminescence detectors such as 
luminometers. 

To provide a further increase in sensitivity, attempts have been made to 
use genes from certain cyanobacteria which encode naturally fluorescent 
phycobiliproteins such as phycoerythrin and phycocyanin. These proteins are 
among the most highly fluorescent known (Oi, V.T., et al, 1 Cell Biol 93:981- 
986 (1982)), and systems have been developed that are able to detect the 
fluorescence emitted from as little as one phycobiliprotein molecule (Peck, K., et 
al, Proc. Natl Acad Scl USA 56:4087-4091 (1989)). Phycobiliproteins also 
have the advantage of being naturally fluorescent, thus eliminating the time- 
consuming steps of the addition of exogenous substrates for their detection as is 
required for ludferase and biochemical reporter genes. However, the 
phycobiliproteins have proven extremely difficult to engineer into gene constructs 
in such a way as to maintain thdr fluorescence (Hdm, R,, et al , Proc. Natl Acad 
Scl USA 97:12501-12504 (1994)), and thus are not commonly used as reporter 
genes in assaying the transfection of mammalian cells. 

Thus, the ideal reporter gene would encode a naturally fluorescent protein 
(for ease of use following transfection) that is highly fluorescent (for increased 
sensitivity) and easily engineered (for maintenance of fluorescence). Such a 
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system has recently been developed, using the Green Fluorescent Proteins (GFPs) 
isolated from certain marine cnidarians. 

GFP 

Overview 

GFPs are involved in bioluminescence in a variety of marine invertebrates, 
including jellyfish such as Aequorea spp. (Morise, H., et al. Biochemistry 
75:2656-2662 (1974); Prendergast, F.G., and Mann, K.G., Biochemistry 77:3448- 
3453 (1978); Ward, V^.V^.^Photochem. PhotobioL Rev. 4\\-Sl (1979) and the sea 
pansy Renilla reniformis (Ward, W.W., and Cormier, M.J., Phoiochem, 
PhotobioL 27:389-396 (1978); Ward, W W., e( al, Photochem. PhotobioL 
57 :6 1 1 -6 1 5 ( 1 980)). The GFP isolated from Aequorea victoria has been cloned 
and the primary amino acid structure has been deduced (Figure 1 ; Prasher, DC, 
et aL, Gene 777:229-233 (1992)) (SEQ ID NOs:l, 2). The chromophore of A. 
victoria GFP is a hexapeptide composed of amino acid residues 64-69 in which 
the amino acids at positions 64-67 (serine, tyrosine and glycine) form a 
heterocyclic ring (Prasher, D.C., et aL, Gene 777:229-233 (1992); Cody, C.W., 
etaL, Biochemistry 52:1212-1218 (1993)). Resolution of the crystal structure of 
GFP has shown that the chromophore is contained in a central a-helical region 
surrounded by an 1 1-stranded P-barrel (Ormo, M., et aL, Science 275: 1392-1395 
(1996); Yang, F., et aL, Nature Biotech 7^:1246-1251 (1996)). Upon 
purification, native GFP demonstrates an absorption maximum at 395 nanometers 
(nm) and an emission maximum at 509 nm (Morise, H., et aL, Biochemistry 
75:2656-2662 (1974);Ward, W.W., et aL, Photochem. PhotobioL 57:61 1-615 
(1980)) with exceptionally stable and virtually non-photobleaching fluorescence 
(Chalfie, M,, etaL, Science 265:802-805 (1994)). 

While GFP has been used as a fluorescent label in protein localization and 
conformation studies (Heim, R., et aL, Proc. NatL Acad. ScL USA 97:1250-1254 
(1994); Yokoe, R, and Meyer, T., Nature Biotech. 7^:1252-1256 (1996)), it has 



gained increased attention in the field of molecular genetics since the 
demonstration of its utility as a reporter gene in transfected prokaryotic and 
eukaryotic cells (Chaffie, M., etaL, Science 263:802-805 (1994); Heim, R., et aL, 
Proc. Natl, Acad ScL USA 97:1250-1254 (1994); Wang, S., and Hazelrigg, T., 
Nal^ire 369:400-403 (1994)). GFP has also been used in fluorescence resonance 
energy transfer studies of protein-protein interactions (Heim, R., and Tsien, R. Y., 
Curr. Biol 6:178-182 (1996)). Since GFP is naturally fluorescent, exogenous 
substrates and cofactors are not necessary for induction of fluorescence, thus 
providing GFP an advantage over the biochemical, luminescent and other 
fluorescent reporter genes described above. Visualization of GFP fluorescence 
does not require the fixation steps necessary with biochemical reporters such as 
P-gal and P-glu, nor does it require extraction fi-om the cell prior to assay as may 
be required with luciferase; thus, GFP is suitable for use in procedures requiring 
continued viability of transfected cells. In addition, since the GFP cDNA 
containing the complete coding region is less than 1 kilobase in size (Prasher, 
D.C./e/a/., Gene J J 1:229-233 (1992)), it is easily manipulated and inserted into 
a variety of vectors for use in creating stable transfectants (Chalfie, M., ei al. 
Science 263:802-805 (1994)). 

Despite these advantages, however, the use of wildtype GFP has a few 
limitations. For example, the excitation and emission maxima of wildtype GFP are 
not within the range of wavelengths of standard fluorescence optics (at which GFP 
demonstrates relatively low quantum yield (i.e., low intensity of fluorescence)). 
In addition, GFP shows low efficiency of transcription in mammalian cells upon 
transfection and is packaged into low-solubility inclusion bodies in bacteria (thus 
providing difficulty in purification). These limitations have been overcome to a 
limited extent via the introduction of selected point mutations into the sequence 
of wildtype GFP. 
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GFP Mutants 

One of the earliest mutation studies of GFP, in which the tyrosine residue 
at position 66 in the wildtype protein ("wt-GFP") was replaced with a histidine 
residue, resulted in a mutant protein which fluoresced blue instead of green when 
excited with ultraviolet (UV) light (Heim, R., et aL, Proa Natl. Acad ScL USA 
97:1250-1254 (1994)). This mutant protein not only provided a capacity for two 
distinguishable wavelengths for use in studies comparing independent proteins and 
gene expression events, but also demonstrated that single point mutations in GFP 
could induce drastic changes in the photochemistry of the protein. Three other 
sets of specific point mutations have been shown to increase the excitation and 
emission maxima of GFP such that they fall well within the range of standard 
fluorescein optics (Ehrig, T., et a/., FEB S Letts. 3(57:163-166 (1995); Delagrave, 
S, etal, Bio/Technology 75:151-154 (1995); Heim, R., and Tsien, R., Curr. BioL 
6:178-182 (1996)), thus permitting the use of GFP with standard laboratory 
fluorescence detection systems. The problem of low quantum yield by wt-GFP 
has been partially addressed by mutating the serine residue at position 65 to a 
threonine ("S65T"), either without (Heim, R., et aL, Proa Natl Acad ScL USA 
9 J: 1250-1254 (1994)) or with (Cormack, B., et aL, Gene 773:33-38 (1996)) a 
concomitant mutation at position 64, or by mutating other residues in the non- 
chromophore region (Crameri, A., et aL, Nature Biotech. 7-/:315-319 (1996)). 
The S65T mutation also appears to improve the rate of fluorophore formation in 
transfected cells by approximately four-fold over wt-GFP, thus allowing earlier 
and more sensitive detection of transfection with this mutant than with wt-GFP 
(Heim, R., et aL, Proa Natl. Acad Sci. USA 97:1250-1254 (1994)). By 
combining the S65T mutation with a mutation at position 64 replacing 
phenylalanine with leucine, approximately 90% of the mutant GFP expressed in 
bacteria is soluble, thus improving protein purification and yields (Cormack, B., 
et al^ Gene 773:33-38 (1996)). Another series of mutations results in a mutant 
fusion GFP consisting of linked blue- and green-fluorescing proteins which have 
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proven useful in studies of protein localization, targeting and processing (Heim, 
R., and Tsien, R.Y., Curr. BioL 6:178-182 (1996)). Analogously, chimeric 
constructs comprising GFP linked to other proteins have been used in studies of 
ion channel expression and fimction (Marshall, J., et aL, Neuron 7-/:21 1-215 
(1995)), and in organelle targeting studies where they have provided a means for 
selectively and distinctively labeling the organelles of living cells (Rizzuto et ai^ 
Curr, Biol 6:183-188 (1996)). Finally, by combining the S65T mutation with 
other mutations throughout the nonchromophore regions of the wt-GFP gene, a 
"humanized" mutant GFP (SEQ ID NOs:3, 4) has been produced that not only 
shows a significant increase in fluorescence intensity and rate of fluorophore 
formation over wt-GFP (via the S65T mutation) but also demonstrates a 22-fold 
increased expression efficiency in mammalian cells (Evans, K , a/., FOCUS 
7Sf2;:40-43(1996);Zolotukhin, S.,€/a/.,y. ViroL 70:4646-4654(1996)). This 
humanization was achieved via 92 base substitutions (in 88 codons) to the wt-GFP 
gene which were amino acid-conservative and which were made to provide a 
pattern of codon usage more closely resembling that of mammahan cells, as 
opposed to the jellyfish codon patterns found in the wt-GFP gene which are less 
eflScientiy translated in mammalian cells. A summary of these GFP chromophore 
mutants is presented in Table 1 . 
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Table 1. GFP Chromophore Mutants. 





1 Amino Acid Residue Number: 




Mutant 


64 


65 


66 


Reference^ 


(Wildtype) 


Phe 


Ser 


Tyr 


Prasher e( 
aL, 1992 


GreenLantem-1 


Phe 


Thr 




Evans et al.^ 
1996 


T-7iimsini7pH ^^T^f 


rilC 


Thr 


lyr 


Zolotukhin 
etal, 1996 


Y66H 


Phe 


Ser 


His 


Heim etal, 
1994 


Y66W 


Phe 


Ser 


Trp 


Y66F 


Phe 


Ser 


Phe 


RSGFPl 


Gly 


Ser 


Tyr 


Delagrave et 
aL, 1995 


RSGFP2 


Leu 


Leu 


Tyr 


RSGFP3 


Gly 


Cys 


Tyr 


RSGFP4 


Met 


Gly 


Tyr 


RSGFP6 


Val 

T CLl 


Ala 


Tvr 
lyr 


RSGFP7 


Leu 


Cys 


Tyr 


S65A 


Phe 


Ala 


Tyr 


Heim etai^ 
1996 


S65L 


Phe 


Leu 


Tyr 


S65C 


Phe 


Cys 


Tyr 


S65T 


Phe 


Thr ■ 


Tyr 


GFPmutl 


Leu 


Thr 


Tyr 


Cormack ei 
aL. 1996 



20 ^ See preceding text for fiill citations. 



Despite some success in overcoming certain of the above-described 
limitations of GFPs, the sensitivity of GFP as a reporter gene (measured as 
percentage of positive cells) is not as high as that of standard biochemical reporter 
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genes such as p-gal (Evans, K., ei ai, FOCUS J8(2)'A0'43 (1996)). In addition, 
the use of GEP as a reporter gene or a protein tag requires the use of fluorescent 
excitation and emission optics, which increases user expense and which is more 
technically challenging than the use of visible or white light optics often used with 
5 standard rqjorters such as p-gal. Thus, a need currently exists for additional GFP 

variants which are more highly fluorescent, humanized, rapidly expressed in 
mammalian ceUs, enable of being observed using standard white light optics, and 
which provide an increased level of sensitivity. 

SUMMARY OF THE INVENTION 

10 It is thus an object of the present invention to provide mutant GFP cDNAs 

and proteins. In one aspect, the invention relates to such mutant GFP cDNAs 
which, when transfected into prokaryotic (e.g., bacterial) or eukaryotic (^.g., 
mammalian) cells, increase the sensitivity of detection (measured as percentage or 
number of positive cells). The present invention thus provides nucleic acid 

15 molecules encoding mutant GFPs, wherein the mutant GFPs have an amino acid 

sequence comprising an amino acid residue lacking an aromatic ring structure at 
position 64 and an amino acid residue having a side chain no longer than two 
carbon atoms in length at position 65. Preferably, (a) if the residue at position 64 
is leucine then the residue at position 65 is not cysteine or threonine; (b) if the 

20 residue at position 64 is valine then the residue at position 65 is not alanine; (c) if 

the residue at position 64 is methionine then the residue at position 65 is not 
, glycine; and (d) if the residue at position 64 is glycine then the residue at position 
65 is not cysteine. The invention is particularly directed to such nucleic acid 
molecules encoding mutant GFPs wherein the amino acid residue at position 64 

25 is alanine, valine, leucine, isoleucine, proline, methionine, glycine, serine, 

threonine, cysteine, alanine, asparagine, glutamine, aspartic acid or glutamic acid, 
most preferably cysteine or methionine. The invention is also particularly directed 
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to such nucleic acid molecules encoding mutant GFPs wherein the amino acid 
residue at position 65 is alanine, glycine, threonine, cysteine, asparagine or 
aspartic acid, most preferably alanine. In particular, the invention provides 
nucleic add molecules encoding mutant GFPs wherein the amino acid at position 
64 is cysteine or methionine and the amino acid at position 65 is alanine, and 
nucleic acid molecules encoding mutant GFPs having an amino acid sequence as 
set forth in either SEQ ID N0:5 or SEQ ID N0:6. 

In additional aspects, the invention provides mutant GFPs encoded by any 
of the above-described nucleic acid molecules, vectors (particularly expression 
vectors) comprising these nucleic acid molecules, host cells (prokaryotic or 
eukaryotic (including mammalian)) comprising these nucleic acid molecules or 
vectors, and compositions comprising plasmid pGreenLantem-2/Al or plasmid 
pGreenLantem-2/A4. The invention also provides methods for producing a 
mutant GFP, comprising culturing the above-described host cells under conditions 
favoring the production of a mutant GFP and isolating the mutant GFP from the 
host cell. The invention also provides mutant GFPs produced by these methods, 
particularly wherein the mutant GFPs emit fluorescent light when. illuminated with 
white light. The invention also relates to compositions comprising the above- 
described mutant GFPs. 

The invention is further directed to kits for transfecting a host cell with the 
nucleic acid molecules encoding the present mutant GFPs, such kits comprising 
at least one container containing a nucleic acid molecule encoding a mutant GFP 
such as those described above, which preferably comprises plasmid 
pGreenLantem-2/Al or plasmid pGreenLantem-2/A4. These kits of the invention 
may optionally further comprise at least one additional container containing a 
reagent, preferably comprising a liposome and most preferably 
LIPOFECT AMINE™, for delivering a mutant GFP nucleic acid molecule into a 
host cell. 



The invention is further directed to kits for labeling a polypeptide with the 
present mutant GFPs, such kits comprising at least one container containing a 
mutant GFP such as those described above, preferably a mutant GFP having an 
amino acid sequence as set forth in SEQ ID N0:5 or SEQ ID N0:6. These kits 
of the invention may optionally further comprise at least one additional container 
containing a reagent for covalently linking this mutant GFP to the target 
polypeptide. 

The fluorescence of all of the GFP mutants provided by the present 
invention is observable with fluorescein optics, making these mutant proteins 
amenable to use in techniques such as fluorescence microscopy and flow 
cytometfy using standard FITC filter sets. In addition, the fluorescence of certain 
of the present GFP mutants, particularly those having amino acid sequences as set 
forth in SEQ ID NOs: 5 and 6, is visible using standard white light optics (e.g.^ 
incandescent or fluorescent indoor lighting, or sunlight). The nucleic acid 
molecules and mutant GFPs provided by the present invention thus contribute 
improved tools for detection of transfection, for fluorescent labeling of proteins, 
for construction of fiision proteins allowing examination of intracellular protein 
expression, biochemistry and trafficking, and for other applications requiring the 
use of reporter genes. 

Other preferred embodiments of the present invention will be apparent to 
one of ordinary skill in light of the following drawings and description of the 
invention, and of the claims. 

BRIEF DESCWPTION OF THE FIGURES 

Figure 1 is a depiction of the nucleotide (SEQ ED NO: 1) and deduced 
amino acid (SEQ ID N0:2) sequences of A, victoria Green Fluorescent Protein 
cDNA (after Prasher, D.C., et ai. Gene J J 1:229-233 (1992)). 
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Figure 2 is a depiction of the nucleotide (SEQ ID N0:3) and deduced 
amino acid (SEQ ID N0:4) sequences of humanized A. victoria Green 
Fluorescent Protein cDNA (after Zolotukhin, S., ei al, J. Virol 70:4646-4654 
(1996)). 

Figure 3 is a depiction of the amino acid sequence (SEQ ID NO:5) of the 
Al GFP mutant. 

Figure 4 is a dq}iction of the amino acid sequence (SEQ ID N0:6) of the 
A4 GFP mutant. 

Figure 5 is a structural map of plasmid pGreenLantem-1 . 

Figure 6 is a structural map of plasmid pGreenLantern-2. 

Figure 7 is a fluorescence photomicrograph of CHO-Kl cells viewed 24 
hours after transfection with the Al GFP mutant (plasmid pGreenLantern-2/Al). 

Figure 8 is a fluorescence photomicrograqjh of CHO-Kl cells viewed 24 
hours after transfection with the A4 GFP mutant (plasmid pGreenLantem-2/A4). 

Figure 9 is a fluorescence photomicrograph of negative control CHO-Kl 
cells viewed 24 hours after transfection with the pGreenLantem-2 backbone. 

Figure 10 is a bar graph demonstrating the fluorescence of CHO-Kl cells 
determined by flow cytometry 24 hours after transfection with various GFP 
mutants. 
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Figure 1 1 is a bar graph demonstrating the fluorescence of CHO-Kl cells 
determined by flow cytometry 48 hours after transfection with various GFP 
mutants. 

5 Figure 12 is a structural map of plasmid pProEX HTb. 

DETAILED DESCRIPTION OF THE INVENTION 

Overview 

The present invention provides nucleic acid molecules encoding mutant 
GEPs, vectors and host cells comprising these nucleic acid molecules, the mutant 

10 GFP polypeptides, and methods for producing mutant GFPs. Although specific 

plasmids, vectors, promoters, selection methods and host cells are disclosed and 
used herein and in the Examples, other promoters, vectors, selection methods and 
host cells, both prokaryotic and eukaryotic, are well-known to one of ordinary 
skill in the art and may be used to practice the present invention without departing 

15 from the scope of the invention or any of the embodiments thereof 

In the present invention, GFPs with selective point mutations at amino acid 
positions 64 and 65 have been constructed and analyzed. In general, it has been 
discovered in the present invention that when the amino acid residue at position 
64 (phenylalanine in wt-GFP) is mutated to an amino acid lacking an aromatic ring 

20 (e.g., alanine, valine, leucine, isoleucine, proline, methionine, glycine, serine, 

threonine, cysteine, asparagine, glutamine, aspartic acid, glutamic acid, lysine, 
arginine or histidine), an increase in fluorescence quantum yield is observed. 
Increased fluorescence intensity is also observed when the amino acid residue at 
position 65 (serine in wt-GFP) is mutated to an amino acid having a side chain 

25 consisting of no more than two carbon atoms (e.g., alanine, glycine, threonine, 

cysteine, asparagine or aspartic acid), which induce a significant "red-shift" in 
excitation maximum fi-om ultraviolet to >asible blue wavelengths and a single 



excitation maximum instead of a dual excitation maximum as in the wildtype 
protein. Together, these general results indicate that in order to construct GFP 
mutants with a dramatic increase in fluorescence intensity from wt-GFP, either 
position 64 or position 65 should contain a reactive amino acid, although 
particular amino acids appear to be preferred at each position as described below. 
Furthermore, it has been unexpectedly discovered that several of the mutant GFPs 
of the present invention, unlike those previously known in the art, will emit 
fluorescence when illuminated by white light (e.g., incandescent or fluorescent 
indoor lighting, or sunlight). > 

Accordingly, in the present invention, specific mutations are introduced 
into positions 64 and 65 of the wt-GFP cDNA sequence (SEQ ID NO l). 
Alternatively, increased expression of the present mutant GFPs may be obtained 
by introducing the preferred mutations into a humanized GFP gene such as that 
described previously (SEQ ED N0:3) (Evans, K., et aL, FOCUS J8(2) A0-43 
(1996);Zolotukhin, S., etaL.J. ViroL 70:4646-4654(1996)). 

Construction of GFP Mutants 

Preparation of GFPPlasmids 

The wt-GFP may be cloned from its natural source, Aequorea victoria, as 
described (Prasher, D.C., ei aL, Gene 777:229-233 (1992)). More preferably, 
GFP cDNA to be mutated is contained within a plasmid construct or vector, 
prefa^bly an expression vector, suitable for use in transfecting mammalian cells, 
such as pRAY-1 wherein the wt-GFP cDNA is under the control of the human 
cytomegalovirus (CMV) enhancer/promoter (Marshall, J., et al. Neuron I4:2\ 1- 
215 (1995)), Most preferably, to provide for optimum expression of the mutant 
GFPs in mammalian ceDs, the humanized S65T mutant GFP cDNA (Evans, K., et 
al, FOCUS J8(2)A0A3 (1996); Zolotukhin, S., etai, J. ViroL 70:4646-4654 
(1996)) under control of the CMV enhancer/promoter may be used, contained in 
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plasmid pGreenLantem-l (Figure 5), which is available commercially from Life 
Technologies, Inc. (Rockville, Maryland). 

The above-described plasmids may be used directly for preparation of 
mutant GFP cDNAs according to the present invention. Alternatively, a stop 
codon in the 5' multiple cloning site of pGreenLantem-1 may be shifted out of 
frame by oligonucleotide ligation methods to allow the mutant GFPs of the present 
invention to be used in the construction of fusions between GFP and other 
proteins, as described below. 

Mutations to GFP cDNA 

A variety of random or site-directed mutagenic techniques may be used to 
prepare the mutant GFPs of the present invention. Appropriate methods include 
chemical mutagenesis using, for example, sodium bisulfite or hydroxylamine 
(Myers, R.M., et al. Science 229:242-247 (1985); Sikorski, R.S., and Boeke, 
ID,, Meih. Enzymol. 79^:302-318 (1991)), linker insertion mutagenesis (Heffron, 
R, eial.Proc, Natl Acad ScL USA 75:6012-6016 (1978)), deletion mutagenesis 
(Lai, C.J., and Nathans, D., J, Mol Biol 59:179-193 (1974); McRnight, S.L., and 
Kingsbury, R., Science 277:316-324 (1982)), enzyme misincorporation 
mutagenesis (Shortle, D., ei al, Proc. Natl. Acad. ScL USA 79:1588-1592 
(1982)), oligonucleotide-directed mutagenesis (Hutchinson, C.A., et aL, J, Biol 
Chem. 253:6551-6560 (1978); ZoUer, M.J., and Smith, M., NucL Acids Res. 
70:6487-6500 (1982); Taylor, J.W., et aL, Nucl Acids Res, 73:8765-8785 
(1985)), and cassette mutagenesis (Lo, K.-M„ et al. Proc. NatL Acad. Sci. USA 
57:2285-2289 (1984); Wells, J.A., et al. Gene 3-/:3 15-323 (1985)). To improve 
the fidelity and eflSciency of mutagenesis, the use of the polymerase chain reaction 
(PGR) in accomplishing GFP mutagenesis by one or more of the foregoing 
methods is preferred (Higuchi, R., etal, Nucl Acids Res. 76:7351-7367 (1988); 
Leung, D.W., et al. Technique 7:1 1-15 (1989); Clackson, T., and Winter, G, 
NucL Acids Res. 77:10163-10170 (1989)). 
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Most preferably, mutations are made to GFP cDNA by uracil DNA 
glycosylase (UDG) mutagenesis using PCR amplification (Nisson, P., et ai, PCR 
Meth, AppL 7:120-123 (1991)). In this approach, the plasmid containing GFP 
cDNA, most preferably pGreenLantem-1 comprising humanized S65T GFP 
(Figure 5), is used as the PCR template, and a sense or antisense primer consisting 
essentially of an oligonucleotide containing at least one mismatched nucleotide 
(available commercially from Life Technologies, Inc.; Rockville, Maryland) is 
added to the reaction mbcture. Amplification reaction mixtures most preferably 
contain IX PCR buffer, about 10 micromolar each of deoxyATP, deoxyTTP, 
deoxyCTP and deoxyGTP, about 25 picomoles each of sense and antisense 
primers and about 10 nanograms of template. PCR is performed by techniques 
that are routine in the art, and after at least five PCR cycles, samples of the 
reaction mixture are treated with UDG» most preferably for 30 minutes at 37*'C, 
as described (Nisson, P., etai, PCRMeih AppL 7:120-123 (1991)). 

The mutated GFP nucleic acid molecules preferably will comprise nucleic 
acid sequences encoding mutant proteins in which one or more amino acid 
residues have been mutated fi-om the wildtype amino acid sequence set forth in 
Figure 1 and SEQ ID N0:2. Such mutations may include, for example, 
substitutions, deletions, insertions or modifications, and preferably are amino acid 
substitutions. Particularly preferred are amino acid substitutions occurring in the 
three amino acid chromophore of GFP at residues 64, 65 and 66 of the wildtype 
GFP sequence (Figure 1 and SEQ ID N0:2), wherein the phenylalanine residue 
at position 64 (Phe64), the serine residue at position 65 (Ser65), and the tyrosine 
residue at position 66 (Tyr66), are each individually, or all together, replaced by 
other amino add residues. More preferred mutant GFPs of the invention include, 
but are not limited to, those with the following substitutions from the wildtype 
GFP sequence shown in Figure 1 and SEQ ID N0:2: 

•serine 65 replaced by threonine (SereS-^Thr); 

•Phe64-->Cys and Ser65-^Ala (SEQ ID N0:5); 
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•Phe64-^Cys and Ser65-^Thr; 
•Phe64->Leu and Ser65->Thr; 
•Phe64-^Met and Ser65-^ Ala (SEQ ID N0:6); 
•Phe64-^Met and Ser65-^Thr; 
5 ^ •Phe64^Met, Ser65">Phe and Tyr66->Phe; 

•Phe64-^Met, Ser65-->Phe and Tyr66->Lys; 
•Phe64->Thr and Ser65^Cys; atid 
•Phe64-^Val and Ser65->Cys 

Other suitable mutations and mutant GFP amino acid sequences may be 

10 determined by one of ordinary skill without undue experimentation according to 

the methods described herein and others that are known in the art. As a practical 
matter, whether a particular mutation or combination of mutations produces a 
mutant GFP that may have the above-described desirable properties {e.g,, higher 
expression in mammalian cells, higher fluorescence intensity under UV or white 

15 light illumination) may be determined by one of ordinary skill using the mutation, 

transfection, expression and detection methods described in detail below in the 
Examples, as well as using standard techniques that are routine in the art. 

Following mutagenesis by any of the above-described methods, the 
resulting nucleic acid molecules encoding the mutant GFPs may be inserted into 

20 one or more vectors, such as those described above, which are preferably 

expression vectors. A particularly preferred vector for containing the present 
mutant GFP nucleic add molecules is p-GreenLantem-2 (Figure 6). Methods for 
producing the mutant GFP-vector constructs will be familiar to those of ordinary 
skill, and are provided in detail below in Example I . 

25 Once they have been constmcted, the vectors comprising the mutant GFP 

nucleic acid molecules may be formulated into a variety of compositions, such as 
solutions {e,g., buffer solutions) to be used in transfecting host cells. 
Alternatively, the vector constructs may be purified and stored according to 
standard techniques for handling recombinant DNA plasmid vectors (Sambrook, 
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J., et aL, Molecular Cloning, a Laboratory Manual, 2nd Ed., Cold Spring 
Harbor, NY: Cold Spring Harbor Laboratory Press, pp. 1 .3-1 .20 (1989)). 

More preferably, the mutant GFP-containing plasmid vectors are 
transformed into a competent host cell. Any competent host, cell may be used, 
including those of bacteria (e,g., K coli), yeast (eg., Saccharomyces spp.), insects 
(e.g., Spodoptera spp.) and mammals (e.g., CHO or BHK cells), although a 
competent strain of£. coli such as DHIOB (Life Technologies, Inc.; Rockville, 
Maryland) is most preferably used. Transformation of mutagenized GFP cDNAs 
into host cells may be accomplished by any technique generally used for 
introduction of exogenous DNA, including the chemical, viral, electroporation, 
lipofection and microinjection methods that are well-known in the art. Particularly 
preferred methods for transformation include electroporation and liposome- 
mediated transfection (lipofection), the latter most preferably being accomplished 
using LDPOFECTAMINE™ (Life Technologies, Inc.; Rockville, Maryland). 

After expansion of transformed cultures, mutated GFP cDNA is isolated 
from the host cells by routine methods (Sambrook, J., et al. Molecular Cloning, 
a Laboratory Manual^ 2nd Ed., Cold Spring Harbor, NY: Cold Spring Harbor 
Laboratory Press, pp. 1.21-1.52 (1989)) and is subcloned into a plasmid backbone 
for use in subsequent transfections. Most preferably, this plasmid backbone is the 
pGreenLantem-2 backbone (see Figure 6) which contains a universal sequencing 
primer downstream from a CMV enhancer promoter and an Nsil site immediately 
upstream of the CMV promoter allowing excision of the promoter region, along 
with A7>flJ, Xhol and HindOl sites in place of the 3' Noil site in pGreenLantem-l 
(Figure 4). 

Fusion sequences of GFP cDNA with nucleotide sequences encoding 
proteins of interest may be prepared by cloning the desired sequence(s) into 
pGreenLantem-2 at the 5' multiple cloning site using standard techniques. These 
fusion constructs allow the use of the mutant GFPs of the present invention as 
reporters of transfection efficiency. In addition, fusion constructs such as these 
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will allow a direct examination of the expression, biochemistry and localization of 
the fiised proteins intracellularly. 

Alternatively, to examine the structure and function of regulatory 
sequences {e.g., promoters, enhancers, inhibitors) in native genes, the GFP mutant 
cDNAs may be directly transfected or inserted, using routine methods, into target 
genomic or extrachromosomal DNA sequences in host cells (Chalfie, M., ei al. 
Science 263:802-805 (1994)). 

Transfection of Hosts With GFP Mutants 

Target cells to be transfected with cDNAs comprising mutant GFPs (either 
fiised or uniused to accessory sequences) are grown and maintained in culture 
according to routine methods. Cells may be transfected with mutant GFP cDN A 
by any method described above, although electroporation or liposome-mediated 
transfection (particulariy using LIPOFECT AMINE™) are preferred. Following 
transfection, cells are incubated for 12-48 hours, preferably 1 8-24 hours and most 
preferably for about 24 hours. Transfected cells may then be examined for the 
expression of mutant GFP, manifested as green intracellular fluorescence. With 
standard optical filters routinely used for examining fluorescein (typically 
excitation wavelength of about 475 rmi, dichroic filter of 485 nm, emission 
wavelength of about 490 nm), this fluorescence may be examined qualitatively, for 
example by fluorescence microscopy, or quantitatively, for example by 
spectrofluorimetry or flow cytofluorimetry. In addition, transfected cells 
expressing relatively high amounts of mutant GFPs of the present invention may 
be separated fi'om non-transfected cells, or fi'oih those expressing lower levels of 
GFP, by fluorescence-based single cell separation techniques such as fluorescence- 
activated cell sorting. Alternatively, transfected cells expressing mutant GFPs that 
fluoresce under white light illumination, particulariy those having amino acid 
sequences as set forth in SEQ ID NOs: 5 and 6, may be examined by the above- 
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described qualitative and quantitative methods using standard white light optics 
(e.g,, incandescent or halogen lighting, or sunlight). 

These transfected host cells may also be used in methods for the 
production of mutant GFPs of the invention. Such methods may comprise, for 
example, culturing the above-described host cells under conditions favoring the 
production of the mutant GFPs by the host cells, and isolating the mutant GFPs 
from the host cells and/or the culture medium in which the host cells are cultured. 
Typical host cell culture conditions favoring production of recombinant proteins, 
such as the present mutant GFPs, are well-known in the art (see^ e.g., Sambrook, 
J., ei ai. Molecular Cloning, a Laboratofy Manual, 2nd Ed., Cold Spring 
Harbor, NY: Cold Spring Harbor Laboratory Press (1989)). The mutant GFPs 
produced by these methods may then be isolated by any of a number of protein 
purification techniques, such as chromatography (preferably aflSnity 
chromatography, HPLC or FPLC), salt extraction (such as ammonium sulfate 
precipitation), electrophoresis, dialysis, or a combination thereof, to produce 
isolated mutant GFPs of the invention. These mutant GFPs may then be stored 
until use (preferably at temperatures below O'^C, more preferably at about -20 °C 
to about -70°C), or they may be formulated into compositions. Preferred such 
compositions may comprise, for example, one or more of the mutant GFPs of the 
invention and one or more additional components, such as one or more buffer 
salts, one or more inorganic salts or ions thereof, one or more detergents, one or 
more preservatives, and the like, preferably in an aqueous or organic solvent. 

Detection Methods 

In additional embodiments, the invention relates to methods of detecting 
the presence of a mutant GFP, or of a cell (such as a prokaryotic or eukaryotic, 
including mammalian, cell) expressing a mutant GFP. Such methods of the 
invention may comprise, for example, illuminating the mutant GFP or cell 
expressing the mutant GFP with a source of white light under conditions such that 
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the mutant GFP or cell expressing the mutant GFP emits visible fluorescent light. 
In the present methods, the illumination source may be any light source emitting 
white (i.e., visible) light, including but not limited to an incandescent light source, 
a fluorescent light source, a halogen light source, sunlight, and the like. When 
illuminated by such a white light source, mutant GFPs, such as those of the 
present invention, will emit fluorescent light of various visible wavelengths 
(depending upon the specific mutations contained in the mutant GFP, as described 
above), which may be detected by eye or by any of the above-described qualitative 
or quantitative mechanical means. 

Kits 

In other preferred embodiments, the compositions of the present invention 
may be assembled into kits for use in transfecting host cells with the nucleic acid 
molecules encoding the present mutant GFPs, or for labeling target polypeptides 
with the present mutant GFPs, Host cell transfection kits according to the present 
invention may comprise at least one container containing one or more of the 
above-described nucleic acid molecules encoding a mutant GFP (or a composition 
comprising one or more of the nucleic acid molecules or plasmids described 
above), which nucleic acid molecule preferably comprises plasmid pGreenLantem- 
2/Al or plasmid pGreenLantem-2/A4 (see Example 1 below). These transfection 
kits of the invention may optionally further comprise at least one additional 
container which may contain, for example, a reagent for delivering the mutant 
GFP nucleic acid molecule into a host cell; in preferred kits, this reagent may 
comprise a liposome and most preferably LIPOFECTAMINE™. Polypeptide 
labeling kits according to the present invention may comprise at least one 
container containing, for example, a mutant GFP such as those described above 
(or a composition of the invention comprising a mutant GFP), which is preferably 
a mutant GFP having an amino acid sequence as set forth in SEQ ID NO: 5 or 
SEQ ID NO: 6. These labeling kits of the invention may optionally further 
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comprise at least one additional container which may contain, for example, a 
reagent for covalently linking the mutant GFP to the target polypeptide. 

Use of Mutant GFPs 

The mutant GFPs and kits of the present invention may be used in a variety 
of applications. For example, the mutant GFP cDNAs are useful as reporter genes 
that allow a determination of transfection efficiency and success (Chalfie, M., et 
aL, Science 263:802-805 (1994)). Alternatively, the mutant proteins themselves 
may be used as fluorescent labels suitable for detectably labeling other proteins, 
nucleic acids or particulates to be used in a variety of applications (Heim, R., ei 
al,Proc. Natl Acad Sci. USA 97:12501-12504 (1994); Yokoe, H,, and Meyer, 
T., Nature Biotech 7^:1252-1256 (1996)), such as labeling antibodies used in 
infectious disease diagnostic methods; mutant GFPs may be attached to target 
polypeptides and proteins by a variety of methods that are well-known to one of 
ordinary skill in the art, including the use of chemical coupling reagents. In 
addition, fusion complexes between GFP and other proteins may be constructed 
to allow closer and more sensitive determinations of the expression, biochemistry, 
localization and trafficking of intracellular proteins in many host cells (Heim, R., 
etaL^Proa NatL Acad Sci. USA 97:12501-12504 (1994); Wang, S., and Tulle, 
n. Nature 569:400-403 (1994); Marshall, J., etaL, Neuron 7^:211-215 (1995); 
Rizzuto, R., etaL, Curr. Biol. 6:183-188 (1996)). Importantly, use of the mutant 
GFPs that emit fluorescence when illuminated by white light will spare the user 
considerable expense and technical difficulty that can accompany the use of 
fluorescent optics for the examination of fluorescent reporter genes such as GFP. 

It will be readily apparent to one of ordinary skill in the relevant arts that 
other suitable modifications and adaptations to the methods and applications 
described herein are obvious and may be made without departing from the scope 
of the invention or any embodiment thereof Having now described the present 
invention in detail, the same will be more clearly understood by reference to the 
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following examples, which are included herewith for purposes of illustration only 
and are not intended to be limiting of the invention. 

Examples 

Example 1: Construction of Mutant GFP cDNAs 

Plasmids. As depicted in Figure 5, pGreenLantern-1 (Life Technologies, 
Inc., Rockville, Maryland; catalogue no. 10642) contains the humanized S65T 
mutant GFP cDNA (Figure 2; SEQ ID N0s:3, 4) (Evans, K., et ai, FOCUS 
I8(2)A0^3 (1996); Zolotukhin, S., ei al, J. Virol 70:4646-4654 (1996)). This 
plasmid serves as the source of the GFP DNA sequence used for mutagenesis. As 
depicted in Figure 6, pGreenLantem-2 contains a universal sequencing primer 
downstream of the CMV promoter along with an }^si\ site immediately upstream 
of the CMV promoter allowing excision of the promoter region. It also contains 
Xbdi, Xho\ and HindWl sites in place of the 3* A^o/I site in pGreenLantem-1 . A 
stop codon in the 5' multiple cloning site of pGreenLantem-1 was shifted out of 
frame to allow possible fusions to GFP in pGreenLantem-2. 

Mutations to GFP cDNA by UDG clomng. PGR was performed in an MJ 
Research DNA Engine™ thermal cycler using the following conditions: 94^*0 for 
60 seconds, 94"C for 30 seconds, 55X for 30 seconds and 72"C for 4 minutes, 
repeated for 20 cycles. Sense oligonucleotide primers containing specific 
mismatches to the wt-GFP sequence (SEQ ID NOs:7-15; Table 2) were obtained 
from Life Technologies, Inc. (Rockville, Maryland). 
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Table 2. Sense Oligonucleotides Used for UDG Cloning Mutations. 



Vector 


Amino 
Acid 
Mutations 


Single-Stranded 
Oligonucleotide Sequence 
(5' to 3') 


SEQID 
NO: 


pGreenLantem- 
2/Al 


Cys64, Ala65 


CAACACUGGUCACUACCTG- 
CGCCTATGGCGTGC 


7 


pGreenLantern- 
2/A2 


Cys64, Thr65 


CCAACACUGGUCACUACCT- 
GCACCTATGG 


8 


pGreenLantem- 
2/A3 


Uu64, Thr65 


CAACACUGGUCACUACCCT- 
CACCTATGGCGTGCAGT 


9 


pGreenLanlem- 
2/A4 


Met64,Ala65 


CAACACUGGUCACUACAAT- 
GGCCTATGGCGTGCAGTGCT 


10 


pGreenLantem- 
2/A5 


Met64, 
Thr65 


CAACACUGGUCACUACCAT- 
GACCTATGGCGTGCAGTGCT 


11 


pGreenLantem- 
2/A6 


Met64, 
Phe65,Phe66 


CAACACUGGUCACUACCAT. 
GTTCTTCGGCGTGCAGTGCT 


12 


pGreenLantern- 

2/A7 


Met64, 
Phe65, Lys66 


CAACACUGGUCACUACCAT- 
GTTCAAGGGCGTGCAGTGCT 


13 


pGreenLantem- 

2/A8 


Thr64, Cys65 


CAACACUGGUCACUACCAC- 
ATGCTATGGCGTGCAGT 


14 


pGreenLantcm- 

2/A9 


Val64, Cys65 


CAACACUGGUCACUACCGT- 
GTGCTATGGCGTGCAGT 


15 



The antisense oligonucleotide primer used for each mutation set had the 
following sequence: S'-AGU-GAC-CAG-UGU-UGG-CCA-AGG-CAC-AGG- 
GAG-CTT-3' (SEQ ID NO: 1 6). The template plasmid used was pGreenLantem- 1 
(Figure 5) with a universal reverse sequencing primer incorporated into the 
backbone. Amplifications reactions contained IX PGR buffer, 10 micromolar 
deoxynucleoside triphosphates, 25 picomoles of each primer (sense and antisense) 
and 10 nanograms of template DNA in a 50 microliter volume. After 6, 9 and 20 
PGR cycles were completed, 10 microliter samples were taken and checked via 
agarose gel electrophoresis for excess background. Two 20 microliter samples of 
each 6-cycle aliquot were digested with Dpnl at 37^*0 for 30 minutes, then at 



wo 98/21355 



PCT/US97/21662 



-29- 

75 °C for 15 minutes and allowed to cool to room temperature. One of the 
samples from each reaction (four samples in all) was treated with one unit of uracil 
DNA glycosylase (UDG) at 3TC for 30 minutes (Nisson, P., et aL, PCR Meth. 
AppL 7:120-123 (1991)). PCR samples were then transformed into 100 
5 microliters of MAX Efficiency DH 1 OB™ Competent Cells (Life Technologies, 

Inc.; Rockville, Maryland). The mutated portion of the GFP cDNA was then 
subcloned with a Notl and BamHl digest into the pGreenLantern-2 backbone 
(Figure 6) which was not subjected to PCR (Sambrook, J., et aL, Molecular 
Cloning, a Laboratory Manual, 2nd Ed., Cold Spring Harbor, NY: Cold Spring 
10 Harbor Laboratory Press (1989)). This approach yielded nine separate mutant 

GFP plasmid vectors, designated pGreenLantem-2/Al through pGreenLantern- 
2/A9 (Table 2), each with a specific mutation or set of mutations within the GFP 
chromophore region at amino acids 64-66. 

Example 2: Growth and Transfection of Host Cells With Mutant GFPs 
15 Cell Culture, Chinese hamster ovary cells (CHO-Kl, obtained from 

American Type Culture Collection (ATCC), Rockville, Maryland) were cultured 
in D-MEM (4,500 milligrams/liter D-glucose with L-glutamine and phenol red) 
plus 10% fetal bovine serum (FBS), 0.1 millimolar nonessential amino acids, 2.5 
units per milliliter penicillin and 2.5 micrograms per milliliter streptomycin 
20 (Freshney, R.L, Culture of Animal Cells: A Manual of Basic Techniques, 3rd Ed., 

New York: Wiley-Liss (1994)). Cells were grown at 37X in a 5% COJdk 
incubator. All media and reagents were from Life Technologies, Inc., Rockville, 
Maryland. 

Transfection. CHO-Kl cells were plated at 2 x 10^ cells per well into six- 
25 well (35 millimeter diameter) plates one day prior to transfection. Immediately 

before transfection, cells were rinsed with medium containing no serum or 
antibiotics. LIPOFECT AMINE™ reagent was diluted into 100 microliters of 
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OPTI-MEM-1 Reduced Seniin Medium (without FBS) to give a final 
concentration of LIPOFECTAMINE of 6 microliters per well. DNA was diluted 
separately to a concentration of 1 microgram per well in 100 microliters of OPTl- 
MEM-I. Transfection complexes were formed by combining diluted lipid and 
DNA and incubating for 30 minutes prior to addition to cells. ' Transfection 
complexes were then diluted 1 :5 with D-MEM containing no FBS or antibiotics 
and added to the rinsed cells. Cells were transfected for five hours at ST^C, then 
fed v^dth an equal volume of D-MEM containing 20% FBS, 0.1 millimolar 
nonessential amino acids, and no antibiotics. Cells were grown overnight at 37°C, 
5% COj/air. In some studies, cells were grown for 48 hours; in these studies, 
transfection complexes were removed from cells 24 hours after addition and cells 
were fed with 2 milliliters per well of complete medium. 

Regardless of the vector used, host cells transfected with the mutant GFP 
genes demonstrated approximately equivalent growth rates as control cells 
transfected v^th the wildtype GFP gene or with other reporter genes (e,g., P-gal). 
These results indicate that transfection with the mutant GFP cDNAs of the present 
invention does not adversely affect the growth or culturability of the host cells 
more than transfection with any other reporter vector. 

Example 3: Characterization of GFP Mutants Expressed in Eukaryotic Cells 

Formalin Fixation, Transfected host cells were rinsed in Dulbecco's 
Phosphate Buffered Saline (PBS), then fixed in a solution of 1 0% formalin in PBS 
for one hour. Formalin was then removed, and cells were rinsed and stored in 
PBS at 4'*C until being analyzed. 

Fluorescence Microscopy. Formalin-fixed cells were examined and 
photographed using an inverted phase contrast fluorescence microscope equipped 
with FITC filters (excitation 475 nm/dichroic 485 nm^arrier 490 nm) and a 50 



watt mercury arc bulb at 1.25 volts. A 40X-power adjustable non-phase objective 
was used for all micrographs, which were taken through blue, neutral and FITC 
filters using Kodak Ektachrome ASA 400 Daylight (for slides) or Kodak Gold 
ASA 400 Daylight (for prints). All exposures were for 12 seconds to allow 
unbiased comparison of fluorescence intensity. 

Flow Cytofluorimetry, Flow cytofluorimetry was performed on 
transfected CHO-Kl cells that were trypsinized and suspended in PBS plus 10% 
formalin at a concentration of less than 10^ cells per milliliter. Measurements were 
made on a Coulter EPICS® XL-MCL flow cytometer using a 1 5 megawatt argon 
ion laser. Filths used were 488 nm excitation, 500 nm dichroic LP/525 nm band 
pass for FLl (green channel) and 575 band pass/600 nm dichroic LP for FL2 
(orange channel). Samples consisted of 20,000 events using PMT voltages of 1 00 
volts for side scatter and forward scatter, 496 volts for FLl and 505 volts for FL2, 
all with integral gain set to 1.0. Color compensation included 7.9% orange signal 
in FLl and 3.2% green signal in FL2. 

Results, As shown in Table 3, the GFP mutants of the present invention 
displayed varying intensities and kinetics of formation in transfected cells. Two 
of these mutants, designated " Al " (phenylalanine mutated to cysteine at position 
64; serine mutated to alanine at position 65; Figure 3; SEQ ID NO: 5) and "A4" 
(ph^ylalanine mutated to methionine at position 64; serine mutated to alanine at 
position 65; Figure 4; SEQ ID N0:6) were exceptionally bright. As shown in 
Figures 7-9, CHO cells transfected with plasmid pGreenLantem-2/Al (Figure 7) 
or with plasmid pGreCTLantem-2/A4 (Figure 8) demonstrated a dramatic increase 
in green fluorescence intensity over cells transfected with the humanized S65T 
mutation of pGreenLantem-1 (Figure 9) when viewed at 24 hours post- 
transfection using FITC optics. 



I 
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Table 3. Effects of Point Mutations on GFP Fluorescence Intensity. 



Vector 


Amino Acids 


Fluorescence Results 


Wildtype GFP 


Phe64, Ser65 


A^=395 nm (major), 470 nm (minor); 48 
hours required for detection 


S65T 


Phe64, Thr65 


6-fold increase in intensity over wildtype 


poreenLuaniem- 1 


rneo4, inro!) 
(humanized) 


22-fold increase in intensity over wildtype 


p\jTceni-.ajiiern- 
2/Al 




6-fold increase in intensity over S65T 


pvjieeiiuaniern- 
2IA2 


v^yso^, inroD 


22-foid increase in intensity over wildtype 


p vjieeuLiuiicrn- 
2/A3 


1 All A/1 TVtrA.^ 


6-fold increase in intensity over S65T 


p\jreeni-.anicni- 
2/A4 


JVieto4, AJaoj 


6-fold increase in intensity over S65T 


poTccnL/amern- 
2/A5 


Meio4, inroj 


Slight increase in intensity over 
pGreenLahtem-1 


2/A6 


Phe66 


iLquivaienc lo wiiaiype 


pGreenLantem- 

2/A7 


Met64, Phe65, 
Lys66 


Equivalent to wildtype 


pGreenLantem- 

2/A8 


Thr64, Cys65 


Equivalent to wildtype 


pGreenLantem- 
2/A9 


Val64, Cys65 


Slight increase in intensity over 
oGreenLantem-l 



Other mutants produced in the present studies were less satisfactory 
25 (Table 3). For example, mutants A5 (phenylalanine mutated to methionine at 

position 64; serine mutated to threonine at position 65) and A9 (phenylalanine 
mutated to valine at position 64; serine mutated to cysteine at position 65) gave 
only slightly better fluorescence than the humanized S65T mutation of 
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pGreenLaJitem-1. It is possible that the highly reactive cysteine at position 65 in 
mutant A9 may interfere with the formation of the three amino acid heterocyclic 
ring required for GFP fluorescence (Cody, C.W., Biochemistry 52:1212-1218 
(1993)). 

Mutant A2 (phenylalanine mutated to cysteine at position 64; serine 
mutated to threonine at position 65) was equal in fluorescence to the humanized 
S65T pGreenLantem-1 (Evans, K., ei al, FOCUS J8(2)A0-43 (1996); 
Zolotukhin, S., ei al, J. ViroL 70:4646-4654 (1996)), while mutants A6 
(phenylalanine mutated to methionine at position 64; serine mutated to 
phenylalanine at position 65; tyrosine mutated to phenylalanine at position 66), A7 
(phenylalanine mutated to methionine at position 64; serine mutated to 
phenylalanine at position 65; tyrosine mutated to lysine at position 66) and A8 
(phenylalanine mutated to threonine at position 64; serine mutated to cysteine at 
position 65) demonstrated a decreased fluorescence intensity and were, in fact, 
equivalent to wt-GFP. No shift in excitation or emission spectra was detected 
with these three mutants, however, as no fluorescence was observed using 
ultraviolet or rhodamine filter combinations. 

These results were also observed via flow cytometry. As shown in Figure 
10, CHO-Kl cells transfected with the Al and A4 mutant GFPs demonstrated a 
dramatic increase in fluorescence over wildtype and A6-A8 mutants within 24 
hours of transfection. This high level of fluorescence was maintained, particularly 
for cells transfected with the A4 mutant GFP, for at least 48 hours after 
transfection (Figure 1 1). 

Mutations at certain amino acid positions outside the chromophore were 
also examined for thdr effects on GFP fluorescence: Mutation of Gln69->Asn in 
the A4 mutant resulted in a dramatic decrease in fluorescence relative to the A4 
mutant itself, as did mutation of Vall63-^Ala and Ilel67-^Thr in the A4 mutant. 

Together, these results indicate that the most preferable mutations for 
providing highly fluorescent, rapidly expressed GFPs are those in which only one 
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reactive amino acid is present at either position 64 or 65, as in the Al 
(Phe64-^Cys; Ser65->Ala; SEQ ID N0:5) and A4 (Phe64-»Met; Ser65-»Ala; 
SEQ TD N0:6) mutants. 

Example 4: Characterization of GFP Mutants Expressed in Prokaryotic Cells 

To examine the efficacy of expressing mutant GFPs in prokaryotic cells, 
mutant GFP cDNAs were subcloned into the bacterial pProEX HTb vector 
(Figure 12). GFP cDNA was excised by Noil and Xba\ digestion from 
pGreenLantem-2 (Figure 6) containing the mutations at positions 64, 65 and/or 
66 (mutants Al through A9) shown in Table 3 . The bacterial vector pProEX HTb 
(Figure 12) was also digested with the same enzymes. The pProEX HTb 
backbone and GFP jfragments were ligated, to form the corresponding transfection 
vectors containing the respective mutant GFP fragments: pProEXAl , pProEXA2, 
pProEXA3, pProEXA4, pProEXA5, pProEXA6, pProEXAT, pProEXAS and 
pProEXA9. These vectors were then individually transformed into 100 jil of 
DHIOB E. coli host cells; control cells were also prepared that had been 
transfected with a construct containing the S65T mutant described in Examples 
1-3 above. Cells were plated onto ampiciUin/IPTG plates and incubated overnight 
at 37**C, and colonies were then picked and screened for fluorescence under long 
ultraviolet (UV) or blue illumination. 

Colonies containing the Al, A2, A3, A4, A5, A9 and S65T mutant GFPs 
all demonstrated green fluorescence when illuminated with long UV or blue light, 
while those containing the A6, A7 and A8 mutant GFPs demonstrated no 
fluorescence under these conditions. These results are consistent with those 
observed in eukaryotic cells, as shown in Example 3 above, and indicate that 
mutant GFPs may be successfully transfected into and expressed in prokaryotic 
cells. 
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Example 5: Visible Light Excitation of GFP Mutants 

To examine the ability of mutant GFPs to emit fluorescence when 
illuminated by white light, E. coli cells were transfected and plated as described 
above in Example 4. Colonies were then picked and examined for fluorescence 
upon illumination by incandescent light, fluorescent indoor lighting, or sunlight. 

Upon induction of the host cells with IPTG, cells transformed with the 
vector comprising the A4 GEP mutation unexpectedly exhibited bright green light 
emission under normal daylight conditions, without the need for excitation vwth 
UV light. Similar results were observed for cells transformed with the A3 mutant 
GFP. Cells containing the Al and A5 mutant GFPs were also seen to be less (but 
still observably) fluorescent under white light illumination. Conversely, only very 
weak emission of light was observed under white light illumination in the cells 
transformed with the vectors comprising only the S65T, A2 and A9 mutations. 
Cells comprising the A6, A7 and A8 mutations exhibited no fluorescence when 
illuminated by white light. 

When plates containing these mutants were stored in the dark at 4°C for 
38 days, however, all of the colonies except those containing the A6, A7 or AS 
mutant GFPs were seen to be more intensely fluorescent under white light 
illumination. Colonies containing the A3, A4 and A5 mutants were more 
fluorescent under these conditions than were those containing the Al, A2, A9 and 
S65T mutants, although all colonies fluoresced more brightly than they did in 
freshly plated cells (i.e., when observed within 24-48 hours of transfection). 
When these plates were allowed to warm to room temperature, the fluorescence 
in colonies containing the Al, A2, A9 and S65T mutants decreased, while that in 
colonies containing the A3, A4 and A5 mutants remained brightly fluorescent. 

It is possible that the increased fluorescence observed in stored plates may 
have been due to accumulation of mutant protein in the cells over time in storage, 
indicating a dependence of white light fluorescence upon intracellular 



-36- 



concentration of the GFP. To test this notion, a 6His-tagged A4 GFP construct 
prepared and isolated by metal affinity chromatography according to standard 
techniques (see Ausubd, F.M., ei aL, in Current Protocols in Molecular Biology, 
New York: John Wiley & Sons, Inc., pp. 10.11.10-10.11.24 (1996)), was 
examined for fluorescence under blue, red and white light at various protein 
concentrations in solution. At a concentration of about 1.5 ng/ml, the purified A4 
GFP was brightly fluorescent under sunlight and fluorescent indoor white lighting, 
as well as under blue light; no fluorescence was observed, however, under red 
light. This highly concentrated A4 GFP solution became nonfluorescent upon 
boiling, but was at least slightly fluorescent up to a temperature of about 82 °C. 
When diluted to 0. 1 |ig/m], however, the A4 GFP solution fluoresced brightly 
under blue light (closer in wavelength to the excitation maximum of GFP which 
is in the UV range), but did not fluoresce under white light illumination. These 
results suggest that the increased fluorescence observed upon white light 
illumination of colonies stored for extended periods of time may be due to 
accumulation of GFP protein in the cells. 

Taken together, these results indicate that prokaryotic cells containing the 
A3 or A4 mutant GFPs, and to a lesser extent the Al and A5 mutant GFPs, can 
emit light without the addition of an exogenous substrate or the use of ultraviolet 
irradiation. Use of these GFP constructs thus provides advantages over other 
visible light reporter vectors which require the use of exogenous substrates, and 
over other fluorescent reporter vectors which require UV irradiation which may 
induce undesirable mutations in the host cells. 

Exanqfle 6: Additional GFP Mutadons 

To examine the eflects of alternative point mutations on GFP fluorescence, 
mutations are targeted at the tryptophan residue at position 67 (the only 
tryptophan residue in the entire GFP molecule which is located in the unique motif 
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Pro-Val-Pro-Trp-Pro (SEQ ID NO: 17)). To acxomplish this mutation, 
oligonucleotides are designed to mutate Trp57-*His or Trp57-^Tyr, in 
conjunction with the Ser65-*Thr mutant (SEQ ID N0:4) or the Phe64->Met; 
Ser65-*Ala mutant (SEQ ID N0:6), These mutants are made in the bacterial 
vector pProEX HTb as described in Example 4, using specific oligonucleotides 
designed to provide the desired mutations. The vector constructs are then 
transfected into host cells and characterized as above for their fluorescence. 

In a similar fashion, mutations are made at other amino acid positions 
outside of the GFP chromophore region. For example, mutations are made at 
Arg96, which is probably responsible for stabilizing resonance structures of the 
imidazolidone 5-membered ring during ring formation and possibly during 
excitation, and is therefore a target for more rapid ring formation and, hence, 
faster detection of fluorescence. Mutations involving this residue include 
Arg96->His. 

Mutations are also possible at Phe46, which along with Phe64 separates 
the 5-membered chromophore ring from direct contact with the single tryptophan 
in the Ser65-*Thr GFP (SEQ ID N0:4). By allowing direct hydrogen bonding 
between Trp57 and the ring structure, efficient energy transfer is possible as with 
the Phe64-^Leu; Ser64->Thr mutant. Mutations involving this residue include 
Phe46"->Leu or other hydrophobic residues that promote hydrogen bonding. 

Mutations are also made at Leu221 and Phe223, which are involved in 
dimer formation. Only three hydrophobic residues are in the dimer contact region; 
all others are hydrophobic. By mutating Leu221 and/or Phe223 to a hydrophilic 
or "neutral" residue such as glycine, GFP aggregation, which can be a problem 
with GFP fosion constructs, may be inhibited. 

Mutations are also made at His 148, which probably stabilizes the 
fluorophore and forms hydrogen bonds vnih Tyr66 and Gln94. Mutations of 
His 148 to a residue with a different charge or a different pKa are made to allow 
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alteration of the excitation and emission spectra of GFP, similar to results seen 
with Tyr66-*His which results in blue fluorescence by GFP. 

Finally, mutations introducing a second 5-membered ring structure into 
the a-helix of GFP are made, to allow increased fluorescence intensity of the 
resultant GFP. 

Having now folly described the present invention in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be obvious 
to one of ordinary skill in the art that the same can be performed by modifying or 
changing the invention within a wide and equivalent range of conditions, 
formulations and other parameters without aflFecting the scope of the invention or 
any specific embodiment thereof, and that such modifications or changes are 
intended to be encompassed within the scope of the appended claims. 

All publications, patents and patent applications mentioned in this 
specification are indicative of the level of skill of those skilled in the art to which 
this invention pertains, and are herein incorporated by reference to the same extent 
as if each individual publication, patent or patent application was specifically and 
individually indicated to be incorporated by reference. 



wo 98/21355 



PCTAJS97/21662 



-39- 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Life Technologies, Inc. 

(B) STREET: 9800 Medical Center Drive 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 20850 

(ii) TITLE OF INVENTION: Mutants of Green Fluorescent Protein 
(iii) NUMBER OF SEQUENCES: 17 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: (To be assigned) 

(B) FILING DATE: 17 -NOV- 1997 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US (To be assigned) 

(B) PILING DATE: 14-NOV-1997 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/030,935 

(B) FILING DATE: 15 -NOV- 1996 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: gfplO 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..714 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATG AGC AAG GGC GAG GAA CTG TTC ACT GGC GTG GTC CCA ATT CTC GTG 48 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
15 10 15 

GAA CTG GAT GGC GAT GTG AAT GGG CAC AAA TTT TCT GTC AGC GGA GAG 96 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

GGT GAA GGT GAT GCC ACA TAC GGA AAG CTC ACC CTG AAA TTC ATC TGC 144 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

ACC ACT GGA AAG CTC CCT GTG CCA TGG CCA ACA CTG GTC ACT ACC TTC 192 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

ACC TAT GGC GTG CAG TGC TTT TCC AGA TAC CCA GAC CAT ATG AAG CAG 240 
Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

CAT GAC TTT TTC AAG AGC GCC ATG CCC GAG GGC TAT GTG CAG GAG AGA 288 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

ACC ATC TTT TTC AAA GAT GAC GGG AAC TAC AAG ACC CGC GCT GAA GTC 336 
Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

AAG TTC GAA GGT GAC ACC CTG GTG AAT AGA ATC GAG TTG AAG GGC ATT 384 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

GAC TTT AAG GAA GAT GGA AAC ATT CTC GGC CAC AAG CTG GAA TAC AAC 432 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

TAT AAC TCC CAC AAT GTG TAC ATC ATG GCC GAC AAG CAA AAG AAT GGC 480 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

ATC AAG GTC, AAC TTC AAG ATC AGA CAC AAC ATT GAG GAT GGA TCC GTG 528 
He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

CAG CTG GCC GAC CAT TAT CAA CAG AAC ACT CCA ATC GGC GAC GGC CCT 576 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

GTG CTC CTC CCA GAC AAC CAT TAC CTG TCC ACC CAG TCT GCC CTG TCT 624 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 
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AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTG CTG GAG TTT GTG 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 



672 



ACC GCT GCT GGG ATC ACA CAT GGC ATG GAC GAG CTG TAC AAG 
Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



714 



TGA 



717 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHTUIACTERISTICS : 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 



Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
1 5 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 



Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
lis 120 125 



Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 



Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 
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Ile Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: gfp (h) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..714 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA ATT CTT GTT 48 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
240 245 250 



GAA TTA GAT GGT GAT GTT AAT GGG CAC 7^ TTT TCT GTC AGT GGA GAG 96 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
255 260 265 270 

GGT GAA GGT GAT GCA ACA TAC GGA AAA CTT ACC CTT A7^ TTT ATT TGC 144 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 

275 280 285 

ACT ACT GGA AAA CTA CCT GTT CCA TGG CCA ACA CTT GTC ACT ACT TTC 192 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
290 295 300 
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TCT TAT GGT GTT CAA TGC TTT TCA AGA TAG CCA GAT CAT ATG AAA CAG 240 
Ser. Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
305 310 315 

CAT GAG TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA CAG GAA AGA 288 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
320 325 330 

ACT ATA TTT TTC AAA GAT GAC GGG AAC TAC AAG ACA CGT GCT GAA GTC 336 
Thr He Phe Phe Lys Asp T^p Gly Asn Tyr Lys Thr Arg Ala Glu Val 
335 340 345 350 

AAG TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG TTA AAA GGT ATT 384 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
355 360 365 

GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA TTG GAA TAC AAC 432 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
370 375 380 

TAT AAC TCA CAC AAT GTA TAC ATC ATG GCA GAC AAA CAA AAG AAT GGA 480 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
385 390 395 

ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT GAA GAT GGA AGC GTT 528 
He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
400 405 410 

CAA CTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT GGC CCT 576 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
415 420 425 430 

GTC CTT TTA CCA GAC AAC CAT TAC CTG TCC ACA CAA TCT GCC CTT TCG 624 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
435 440 445 

AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTT CTT GAG TTT GTA 672 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
450 455 460 

ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC AAA 714 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
465 470 475 



TAA 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



717 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 



Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 



Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

. Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Aep Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Cys 
50 55 60 

Ala Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 • 95 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Aep Lys Gin Lys J^n Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 



Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
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210 , 215 220 

Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 

225 230 235 

(2) INFORMATION FOR SEQ ID NO: 6: 

■i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lye Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Met 
50 55 €0 

Ala Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
^ 85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 

115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 
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Gin Leu Ala Asp His Tyx Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CAACACUGGU CACUACCTGC GCCTATGGCG TGC 33 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCAACACUGG UCACUACCTG CACCTATGG 29 
(2) INFORMATION FOR SEQ ID NO: 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRT^EDNESS: single 

(D) TOPOLOGY: both 
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(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CAACACUGGU CACOACCCTC ACCTATGGCG TGCAGT 36 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAACACUGGU CACUACAATG GCCTATGGCG TGCAGTGCT 39 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CAACACUGGU CACUACCATG ACCTATGGCG TGCAGTGCT 39 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CAACACUGGU CACOACCATG TTCTTCGGCG TGCAGTGCT 39 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CAACACUGGU CACUACCATG TTCA7W3GGCG TGCAGTGCT 39 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CaUlClACDGGU CACUACCACA TGCTATGGCG TGCAGT 36 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CAACACUGGU CACUACCGTG TGCTATGGCG TGCAGT 36 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHT^CTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AGUGACCAGU GUUGGCCAAG GCACAGGGAG CTT 33 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Pro Val Pro Trp Pro 
1 5 
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WHATIS CLAIA4ED IS: 

1 . A nucleic acid molecule encoding a mutant Green Fluorescent 
Protein, said mutant Green Fluorescent Protein having an amino acid sequence 
comprising an anuno acid residue lacking an aromatic ring structure at position 64 
5 and an amino acid residue having a side chain no longer than two carbon units in 

length at position 65, with the provisos that 

if said residue at position 64 is leucine then said residue at position 65 is 
not cysteine or threonine; 

if said residue at position 64 is valine then said residue at position 65 is not 

10 alanine; 

if said residue at position 64 is methionine then said residue at position 65 
is not glycine; and 

if said residue at position 64 is glycine then said residue at position 65 is 
not cysteine. 



15 2. The nucleic acid molecule of claim 1, wherein said amino acid 

residue at position 64 is selected from the group consisting of alanine, valine, 
leucine, isoleucine, proline, methionine, glycine, serine, threonine, cysteine, 
alanine, asparagine, glutamine, aspartic acid and glutamic acid. 

3. The nucleic acid molecule of claim 1, wherein said amino acid 
20 residue at position 64 is cysteine or methionine. 



4. The nucleic acid molecule of claim 3, wherein said amino acid 
residue at position 65 is alanine. 
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5. The nucleic acid molecule of claim 1, wherein said amino acid 
residue at position 65 is selected from the group consisting of alanine, glycine, 
thureonine, cysteine, asparagine and aspartic acid. 

6. The nucleic acid molecule of claim 1, wherein said amino acid 
residue at position 65 is alanine. 

7. The nucleic acid molecule of claim 6, wherein said amino acid 
residue at position 64 is cysteine or methionine. 

8. A nucleic acid molecule encoding a mutant Green Fluorescent 
Protein, said mutant Green Fluorescent Protein having an amino acid sequence as 
set forth in SEQ ID NO: 5 . 

9. A nucleic acid molecule encoding a mutant Green Fluorescent 
Protein, said mutant Green Fluorescent Protein having an amino acid sequence as 
set forth in SEQ ID N0:6. 

10. A mutant Green Fluorescent Protein having an amino acid 
sequence comprising an amino acid residue lacking an aromatic ring structure at 
position 64 and an amino acid residue having a side chain no longer than two 
carbon atoms in length at position 65, vAth the provisos that 

(a) if said residue at position 64 is leucine then said residue at position 65 
is not cysteine or threonine; 

(b) if said residue at position 64 is valine then said residue at position 65 
is not alanine; 

(c) if said residue at position 64 is methionine then said residue at position 
65 is not glycine; and 
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(d) if said residue at position 64 is glycine then said residue at position 65 
is not cysteine. 

1 1 . The mutant Green Fluorescent Protein of claim 10, wherein said 
amino acid residue at position 64 is selected from the group consisting of alanine, 
valine, leucine, isoleudne, proline, methionine, glycine, serine, threonine, cysteine, 
alanine, asparagine, glutamine, aspartic acid and glutamic acid. 

12. The mutant Green Fluorescent Protein of claim 10, wherein said 
amino acid residue at position 64 is cysteine or methionine. 

13. The mutant Green Fluorescent Protein of claim 12, wherein said 
amino acid residue at position 65 is alanine. 

14. The mutant Green Fluorescent Protein of claim 10, wherein said 
amino acid residue at position 65 is selected from the group consisting of alanine, 
glycine, threonine, cysteine, asparagine and aspartic acid. 

15. The mutant Green Fluorescent Protein of claim 10, wherein said 
amino acid residue at position 65 is alanine. 

16. The mutant Green Fluorescent Protein of clsum 15, wherein said 
amino acid residue at position 64 is cysteine or methionine. 

17. A mutant Green Fluorescent Protein having an amino acid 
sequence as set forth in SEQ ID N0:5. 

18. A mutant Green Fluorescent Protein having an amino acid 
sequence as set forth in SEQ ID N0.6. 
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19. A host cell comprising the nucleic acid molecule of claim 1 . 

20. A vector comprising the nucleic acid molecule of claim 1 . 

2 1 . The vector of claim 20, wherein said vector is an expression vector. 

22. A host cell comprising the vector of claim 20. 

23. A method for producing a mutant Green Fluorescent Protein, 
comprising culturing the host cell of claim 19 or claim 22 under conditions 
favoring the production of a mutant Green Fluorescent Protein, and isolating said 
mutant Green Fluorescent Protein from said host cell. 

24. A mutant Green Fluorescent Protein produced by the method of 
claim 23. 

25 . The mutant Green Fluorescent Protein of any one of claims 1 0, 1 7, 
18 or 24, wherein said mutant Green Fluorescent Protein emits fluorescent light 
when illuminated by white light. 

26. A composition comprising plasmid pGreenLantem-2/A 1 . 

27. A composition comprising plasmid pGreenLantem-2/A4. 

28. A composition comprising the mutant Green Fluorescent Protein 
of any one of claims 10, 17, 18 or 24. 

29. A composition comprising the mutant Green Fluorescent Protein 
of claim 25. 
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30. A kit for transfecting a host cell with a nucleic acid molecule 
encoding a mutant Green Fluorescent Protein, said kit comprising at least one 
container containing the nucleic acid molecule of claim 1 . 

3 1 . The kit of claim 30, wherein said nucleic acid molecule comprises 
plasmid pGreenLantern-2/Al or plasmid pGreenLantem-2/A4. 

32. The kit of claim 30, further comprising at least one additional 
container containing a reagent for delivering said nucleic acid molecule into a host 
cell. 

33. The kit of claim 32, wherein said reagent for delivering said nucleic 
acid molecule into a host cell comprises a liposome. 

34. A kit for labeling a polypeptide with a mutant Green Fluorescent 
Protein, said kit comprising at least one container containing the mutant Green 
Fluorescent Protein of any one of claims 10, 17, 1 8 or 24. 

35. The kit of claim 34, wherein said mutant GFP fluoresces when 
illuminated by white light. 

36. The kit of claim 34, further comprising at least one additional 
container containing a reagent for covalently attaching said mutant Green 
Fluorescent Protein to a polypeptide. 

37. A method of detecting the presence of a mutant GFP comprising 
illuminating the mutant GFP wdth a source of white light under conditions such 
that the mutant GFP emits visible fluorescent light. 
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38. A method of detecting the presence of a cell expressing a mutant 
GFP comprising illuminating the cell with a source of white light under conditions 
such that the mutant GFP expressed by the cell emits visible fluorescent light. 
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ATG AGT AAA 6GA GAA GAA CTJ TTC ACT GGA GTT GTC CCA ATT CJl GTT 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Vol Pro He Leu Val 
15 10 15 

GAA TTA GAT G6T GAT GTT AAT GGG CAC AAA TTT TCT GTC AGT GGA GAG 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser G1y Glu 
20 25 30 

GGT GAA GGT GAT GGA ACA TAC GGA AAA CH ACC CTT AAA TTT AH TGC 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

ACT ACT GGA AAA CTA CCT GTT CCA TGG CCA ACA CTT GTC ACT ACT TTC 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

TCT TAT GGT GTT CAA TGC TTT TCA AGA TAC CCA GAT CAT ATG AAA CAG 
Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

CAT GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA CAG GAA AGA 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

ACT ATA TTT TTC AAA GAT GAC GGG AAC TAC AAG ACA CGT GCT GAA GTC 
Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

AAG TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG HA AAA GGT AH 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA TTG GAA TAC AAC 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

TAT AAC TCA CAC AAT GTA TAC ATC ATG GCA GAC AAA CAA AAG AAT GGA 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

ATC AAA GU AAC TTC AAA AH AGA CAC AAC AH GAA GAT GGA AGC GTT 
He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

CAA CTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT GGC CCT 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

GTC Cn TTA CCA GAC AAC CAT TAC CTG TCC ACA CAA TCT GCC CJl TCG 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTT CH GAG HT GTA 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

ACA GCT GCT GGG AH ACA CAT GGC ATG GAT GAA CTA TAC AAA TAA 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys * 
225 230 235 

(SEQ ID NOs:l. 2) 

FIG.1 

SUBSTITUTE SHEET (RULE 26) 
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Met 


ART 

no 1 

Ser 
240 


AAA 

Lys 


RRA 
Gly 


Glu 


RAA 
Glu 


TTT 

Leu 
245 


TTC 

Phe 


APT 

Thr 


RRA 

Gly 


u 1 1 

Val 


u 1 U 

Val 
250 


rrA 
Pro 


ATT 

He 


TTT 
u f 1 

Leu 


RTT 

Val 


AQ 


GAA 
G1u 
255 


JJA 
Leu 


GAT 
Asp 


GGT 
Gly 


GAT 
Asp 


m 

Val 
260 


AAT 
Asn 


GGG 
Gly 


CAC 
His 


AAA 

Lys 


TTT 
Phe 

265 


TCT 
Ser 


GTC 
Val 


AGT 
Ser 


GGA 
Gly 


GAG 
Glu 
270 


96 


GGT 
Gly 


GAA 
Glu 


GGT 
Gly 


GAT 
Asp 


GCA 
Ala 
275 


ACA 
Thr 


TAC 
Tyr 


GGA 
Gly 


AAA 
Lys 


CTT 
Leu 
280 


ACC 
Thr 


CTT 
Leu 


AAA 
Lys 


m 

Phe 


AH 
He 
285 


TGC 
Cys 


144 


ACT 
Thr 


ACT 
Thr 


GGA 
Gly 


AAA 

Lys 

290 


CTA 
Leu 


CCT 
Pro 


GH 
Val 


CCA 
Pro 


TGG 
Trp 
295 


CCA 
Pro 


ACA 
Thr 


CTT 
Leu 


GTC 
Val 


ACT 
Thr 
300 


ACT 
Thr 


nc 

Phe 


192 


TCT 
Ser 


TAT 
Tyr 


GGT 
Gly 
305 


GH 
Val 


CAA 
Gin 


TGC 
Cys 


m 

Phe 


TCA 
Ser 
310 


AGA 
Arg 


TAC 
Tyr 


CCA 
Pro 


GAT 
Asp 


CAT 
His 
315 


ATG 
Met 


AAA 
Lys 


CAG 
Gin 


240 


CAT 
His 


GAC 
Asp 
32D 


TTT 
Phe 


TTC 
Phe 


AAG 
Lys 


AGT 
Ser 


GCC 
Ala 
325 


AT6 
Met 


CCC 
Pro 


GAA 
Glu 


GGT 
Gly 


TAT 
Tyr 
330 


GTA 
Val 


CAG 
Gin 


GAA 
Glu 


AGA 
Arg 


288 


ACT 
Thr 
335 


ATA 
lie 


m 

Phe 


TTC 
Phe 


AAA 
Lys 


GAT 
Asp 
34D 


GAC 
Asp 


GGG 
Gly 


AAC 
Asn 


TAC 
Tyr 


AAG 
Lys 
345 


ACA 
Thr 


CGT 
Arg 


GCT 
Ala 


GAA 
Glu 


GTC 
Val 
350 


336 


AAG 
Lys 


TTT 
Phe 


GAA 
Glu 


GGT 
Gly 


GAT 
Asp 
355 


ACC 
Thr 


cn 

Leu 


GTT 
Val 


AAT 
Asn 


AGA 
Arg 
360 


ATC 
He 


GAG 
Glu 


TTA 
Leu 


AAA 
Lys 


GGT 
Gly 
365 


ATT 
He 


384 


GAT 
Asp 


m 

Phe 


AAA 
Lys 


GAA 
Glu 
370 


GAT 
Asp 


GGA 
Gly 


AAC 
Asn 


AH 
He 


CTT 
Leu 
375 


GGA 
Gly 


CAC 
His 


AAA 
Lys 


m 

Leu 


GAA 
Glu 
380 


TAC 
Tyr 


AAC 
Asn 


432 


TAT 
Tyr 


AAC 
Asn 


TCA 
Ser 
385 


CAC 
His 


AAT 
Asn 


GTA 
Val 


TAC 
Tyr 


ATC 
He 
390 


ATG 
Met 


GCA 
Ala 


GAC 
Asp 


AAA 
Lys 


CAA 
Gin 
395 


AAG 
Lys 


AAT 
Asn 


GGA 
Gly 


480 


ATC 
He 


AAA 
4I 


GTT 
Val 


AAC 
Asn 


TTC 
Phe 


AAA 
Lys 


AH 
He 
405 


AGA 
Arg 


CAC 
His 


AAC 
Asn 


AH 
He 


GAA 
Glu 
410 


GAT 
Asp 


GGA 
Gly 


AGC 
Ser 


GTT 
Val 


528 


CAA 
Gin 
415 


CTA 
Leu 


GCA 
Ala 


GAC 
Asp 


CAT 
His 


TAT 
Tyr 
420 


CAA 
Gin 


CAA 
Gin 


AAT 
Asn 


ACT 
Thr 


CCA 
Pro 
425 


ATT 
He 


GGC 
Gly 


GAT 
Asp 


GGC 
Gly 


CCT 
Pro 
430 


576 


GTC 
Val 


cn 

Leu 


TTA 
Leu 


CCA 
Pro 


GAC 
Asp 
435 


AAC 
Asn 


CAT 
His 


TAC 
Tyr 


CTG 
Leu 


TCC 
Ser 
440 


ACA 
Thr 


CAA 
Gin 


TCT 
Ser 


GCC 
Ala 


cn 

Leu 
445 


TCG 
Ser 


624 


AAA 
Lys 


GAT 
Asp 


CCC 
Pro 


AAC 
Asn 
450 


GAA 
Glu 


AAG 
Lys 


AGA 
Arg 


GAC 
Asp 


CAC 
His 
455 


ATG 
Met 


GTC 
Val 


CTT 
Leu 


CTT 
Leu 


GAG 
Glu 
460 


m 

Phe 


GTA 
Val 


672 


ACA 
Thr 


GCT 
Ala 


GCT 
Ala 
465 


GGG 
Gly 


AH 
lie 


ACA 
Thr 


CAT 
His 


GGC 
Gly 
470 


ATG 
Met 


GAT 
Asp 


GAA 
Glu 


CTA 
Leu 


TAC 
Tyr 
475 


AAA 
Lys 






714 



(SEQ ID N0s:3. 4) 

FIG. 2 
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Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Cys 
50 55 60 

Ala Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 
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Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
1 5 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

GTy Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Met 
50 55 60 

Ala Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu. Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 

225 230 235 
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